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We present an approach to type theory in which the typing judgments do not have explicit contexts. 
Instead of judgments of shape r h A : B, our systems just have judgments of shape A : B. A key 
feature is that we distinguish free and bound variables even in pseudo-terms. 

Specifically we give the rules of the 'Pure Type System' class of type theories in this style. We 
prove that the typing judgments of these systems correspond in a natural way with those of Pure 
Type Systems as traditionally formulated. I.e., our systems have exactly the same well-typed terms 
as traditional presentations of type theory. 

Our system can be seen as a type theory in which all type judgments share an identical, infinite, 
typing context that has infinitely many variables for each possible type. For this reason we call our 
system Too. This name means to suggest that our type judgment A : B should be read as Too h A : B, 
with a fixed infinite type context called T^. 

1 Introduction 
1.1 Problem 

One of the important insights type theory gives us is the need to be aware of the context in which one 
works. This was already stressed by de Bruijn in his 1979 paper Wees contextbewust in WOT [6], Dutch 
for "Be context aware in the mathematical vernacular". In type theory a term always is considered with 
respect to a context T, which gives the types of the variables occurring free in the term. This is also 
apparent in the shape r h M : A of the judgments of type theory, where the context T is made explicit. 
Thus a 'bound' variable is bound locally in a term, while a 'free' variable actually is globally bound, 
namely by the context. 

In customary presentations of first order predicate logic lfl6l l22l for example], and in fact in the 
presentation of most other logics as well, free variables are not treated in such a way. In these logics free 
variables are really free. They are taken from an infinite supply of variables that are just available to be 
used in formulas and terms, without them having to be declared first. 

This difference between type theory and predicate logic means that when we model predicate logic 
in type theory, actually we do not get the customary version of predicate logic, but instead get a version 
called free logic lfl2l . In traditional treatments the formula (Vjc.P(jc)) — > (3x.P(x)) is usually provable. 
For instance a natural deduction proof of this formula would look like: 
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[Vx.P(x)} 



3x.P(x) 
(Vx.P(jc)) (3x.P(x)) 

This proof uses the free variable y. If one cannot use any other variables than those introduced by earlier 
rules, then this proof fails. Indeed, the type corresponding to the formula is not inhabited: there is no 
term M such that the following judgment is derivable: 

D:*,P:D^*\-M:(Ux: D.P{x)) -> (Lx : D.P(x)) 

because we cannot avoid the case in which the domain D is empty, where the formula is false. 

Now there are two things one can do to bridge this gap between type theory and traditional logic: 

• Make predicate logic more like type theory, by explicitly keeping track in the judgments of the set 
of variables that can be used in the proof. 

• Make type theory more like predicate logic, by having a version of type theory that does not need 
contexts in the judgments, i.e., in which free variables are just taken from some infinite supply. 

Although the first option is interesting too, especially in categorical treatments of logic ifTTL for example], 
in this paper we focus on the second. We originally thought that the dependent types in type theory would 
prevent a version of type theory without contexts from being a viable option, but to our surprise it turns 
out that one can present type theory in a style where there are no contexts and in which therefore free 
variables are really free, provided we are prepared to pay the small price of labelling variables in a rather 
involved manner. 

In those type theories actually implemented in interactive theorem provers, the context always con- 
sists of a part holding global definitions and parameters and a part holding the free variables in the term 
(as in |[T9ll2Tl for example]). For simplicity of exposition, and for the sake of proving an exact corre- 
spondence between a standard presentation of type theory and the variant we propose, in this paper we 
consider only the second part of such contexts. We believe, however, that the other part can be treated in 
exactly the same way. 

There is another reason why it is interesting to look at a version of type theory where there are no 
explicit contexts. One of the most popular architectures for proof assistants is the LCF architecture, 
named after the LCF system from the seventies [9]. In the original form of such a system there is an 
abstract data type called term, whose elements can only be created by a small number of functions 
exported from the type-checking kernel. Elements of this datatype always correspond to type-correct 
terms, and those terms can contain free variables. 

A system using this approach has a kernel interface containing a function: 
app : term * term -> term 
When this function is called, the kernel of the system makes sure that the types of the arguments are 
compatible, i.e., that the result is again a type-correct term. 

This is how the HOL family of theorem provers is implemented. These systems have a logical 
foundation that is based on a typed lambda calculus. However, in these systems the free variables in 
the terms are not recorded in a context of variables. The only context in these systems is the context 
of definitions, which is kept track of in a stateful variable. Definitions are never allowed to be removed 
from this context, as that would compromise the safety of the kernel. Hence, these systems are stateful, 
although they can be made functional using a variant of the approach presented here ll24ll . 
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There are two classes of systems that can be said to implement a type theory, a typed lambda calculus: 

• The simpler type theories, in which no dependent types are allowed. They often are a form of 
simple type theory with some enhancements, such as some form of polymorphism or type classes. 
These include the systems from the HOL family: HOL4, HOL Light, ProofPower and Isabelle. 
These systems can be, and are, implemented following the LCF architecture just outlined. In these 
systems variables come from an infinite 'sea' of free variables, and in the logical theory there is no 
context keeping track of the variables. 

• More advanced type theories, often called type theory with dependent types. These come from the 
Dutch Automath systems, the Swedish tradition of Martin-Lof type theory, the French tradition 
of the Calculus of Constructions and variations on the Edinburgh Logical Framework. Their im- 
plementations include Coq, NuPRL, Twelf, Agda, Lego, Plastic and Epigram. In the logical theory 
of these systems there is a context keeping track of the variables. 

For this second style of type theory the pure LCF approach is not attractive. The app function will need 
to check whether the contexts in which the terms live are compatible, which will be very expensive, if it 
needs to be done for the type-checking of each function application. 

For this reason actual type-theoretical proof assistants have a kernel with a different kind of interface. 
In such a kernel there is no abstract datatype of terms (there just is a — non-abstract — type of pseudo- 
terms). Instead there is an abstract type of well-typed contexts, which we call context here. (There is 
also a type pseudocontext, but that is irrelevant here.) The interface then looks like: 

mkApp : pseudoterm * pseudoterm -> pseudoterm 
add_constant : string * pseudoterm -> context -> context 

where mkApp is just the constructor of the data type of pseudoterms, whereas add_constant is a function 
that does the type-checking: a pseudoterm will only be added to the context after it has been type- 
checked. (These are the actual names of the functions in the kernel of the Coq system. The types of 
those functions in Coq are essentially what is presented here. The type pseudoterm is called constr in 
Coq, while the type context is called saf e_environment.) The system also has a global variable 

global_env : context ref 

corresponding to the state of the system in which the user works. It is not part of the kernel (and in fact is 
changed back by an undo operation), but as there are no interesting operations combining two different 
contexts, only one global context is ever relevant, the one given by the contents of this variable. 

Although the architecture with contexts that we described is purely functional (as is the Coq kernel), 
the fact that the actual implementation has this global variable means that it is used in a rather 'stateful' 
way. The desire to investigate a possible LCF-style kernel for type theory that is 'less stateful' motivated 
this research. In the conclusions we will address the question whether the style of type theory we present 
here will lead to such a type-checking architecture. 

1.2 Approach 

The approach we will follow here is to imagine there to be an 'infinite context' called Too. For each type- 
correct type A this context will have infinitely many variables xf. It should be stressed that this A should 
be considered to just be a label, a string. Reduction will never happen inside these As. Also, xf and xf 
will be different variables, even when A and B are convertible, or even if they are a-equivalent. Note that 
the (free) variables in A themselves will also be of shape y? : this means that the variables themselves, as 
well as the terms, have a recursive tree-like structure. 
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For example, a variable corresponding to the successor function on natural numbers looks like: 
If we use numbered names for the variables, this might become: 



So the "small price" alluded to above is that a free variable xf in a well-typed term carries with it the 
(well-founded) history of how it comes to be well-typed; that is, the label A witnesses the validity of the 
context extension r,x, : A. 

Now our systems will have judgments A : B, which should be interpreted as Too h A : B. For this 
reason we call the general approach to type theory introduced here Too' (reusing the name of the context 
as the name of the system). Note that Too is not a single system: each type theory will have a Too-variant'. 

The Too approach has the essential feature that there are two different classes of variables. There 
are the variables that come from the Too context (the 'free' variables), and then there is another kind of 
'bound' variables. When thinking about our systems one might imagine de Bruijn indices for the bound 
variables, although the presentation we give here uses named variables for them as well. 

Although we expect many type theories to have a Too-variant, here we only consider the class of type 
theories called Pure Type Systems (PTSs). That way we keep everything concrete, and it allows us 
to prove a precise correspondence between PTSs in the traditional style and our version in Too style. 

One should note that in Too any type will be inhabited, simply because there are free variables of every 
type: in particular, just as in traditional treatments of logic, all domains are assumed to be inhabited. 
This is in essence the same as in the version with contexts, except that such variables are there explicitly 
recorded in the context. 

1.3 Related Work 

To our surprise, we found little published work investigating such an approach to dependent type theory. 
In Church's original formulation of simple type theory 0, variables, both free and bound, are anno- 
tated with their types, writing for example Xx a .f a ^ a x a . (whereas in our formulation we would write 
Xx:a*.f a *^ a * x.) Girard adopted 'Church-style' in the account of System F in his thesis [8j. In neither 
system do term variables occur in types, while type variables are not regulated by an explicit context. In 
these non-dependent type theories, contexts are not strictly needed, because one can define the different 
syntactic classes — types, terms — in stages. One can regard our approach as extending that of Church 
to dependent types, but optimised to avoid the need to consider substitution in labels on bound variables 
which otherwise might arise in the application rule. 

Conor McBride (private communication) observed that Pollack's LEGO implementation already 
supported the Too idea, and this idea was then used in his OLEG extensions, and subsequently in the 
architecture of EPIGRAM 1 . However, this approach has not been treated theoretically as we do here. 

The explicit distinction between free and bound variables on a syntactic level already can be found in 
lfl31 . The motivation there was to avoid capture during substitution while keeping a close correspondence 
with the informal presentation of PTSs with named variables, rather than how to give a Too' presentation, 
as here. Various approaches to representing binding are discussed in ll23ll . which considers named free 
variables and de Bruijn index bound variables one of the best options for mechanisation. Indeed, in 
ongoing work [ 10] the second author has formalised one half of the correspondence proved in Section [4] 
in such a style. We expand on the niceties of this formalisation in Section|5] 
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Elsewhere, Pollack considered presentations of type theory separating the typing judgment from that 
for context well-formedness 11 7~il. although judgments are still 'in context'. This allows a subtle range of 
issues to be explored, especially regarding closure under a-conversion, here treated only informally. 

Most significantly, but starting from a rather different point, Sacerdoti Coen considered the problem 
of proof-checking in the setting of a distributed library, and hence the problem of how to reconstruct a 
context in which a given term may be successfully type-checked lfl9l . This work (elaborated in |[T8ll . and 
forming the basis of the Matita system) goes beyond the standard PTS setting considered in this paper. 
It identifies a subtle problem which arises when attempting to merge contexts (including definitions) in 
the presence of global constraints (such as universe levels). 

Added in proof. Between acceptance of the final version of this paper and this final version, our 
attention was drawn to the recent work of Matthias Boespflug (H, which itself references an earlier 
(2009) account of our ideas. By contrast with our presentation, which is purely first-order, Boespflug 
uses higher-order abstract syntax (HOAS), with a view to implementation. 



1.4 Contribution 



We present a different approach to type theory, much closer to the way logical systems usually are 
presented than the standard presentation, in which free variables are not bound in a finite context but are 
taken to be really free. 

We validate our approach by proving two theorems, 13 and 19 below, establishing a straightforward 
correspondence between the standard presentation and the variant presented here. 



1.5 Outline 

The structure of the paper is as follows. In Section [2] we recall the PTS rules and some of its theory. In 
Section [3] we present the Too-variant of the PTS rules, in which the judgments do not have contexts. In 
Section|4]we show that both systems correspond to each other in a natural way. In Section[5]we conclude 
with a prospectus for an implementation based on our variant of the PTS rules. 



2 Pure Type Systems in the traditional style 

Pure Type Systems (PTSs) generalize many existing type systems and thus the class of PTSs contains 
various well-known systems, like systems F and Fco, dependent type theory AP and the Calculus of 
Constructions. 

Definition 1. For 5? a set {the set of sorts), stf C 5? x 5? {the set of axioms) and 2% C 5? x x ,5? 
{the set of rules), the Pure Type System X{5^ \s4 ,M) is the typed X-calculus with inference rules as in 
Figure^ In the rules, the expressions M,N,A,B,C are taken from the set of pseudo-terms & defined by 

sr ::=s | r I yyv-.st.st \ xr-.^.sr \ srsr. 

with y a set of variables, and the Y taken from the set of pseudo-contexts 

x\ : Ai,...,x n : A n (x,- e y,A,- e5,l<i<«) 

with the Xi all distinct. {We leave the choice of variable names unspecified at this point, as this does not 
matter as long as Y is countably infinite, but below we will take a specific choice of names.) 
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Figure 1: Typing rules for PTSs 



There is a lot of theory about PTSs and various systems have been studied in the context of PTSs. 
We do not give a complete overview but refer to EH 13 for details and explanation. Here we just give 
the results that we use in the rest of the paper to prove the equivalence between a PTS and its Too-variant. 

Definition 2. The pseudo-term A is called well-typed if a pseudo-context T and pseudo-term B exist such 
that T \- A . B orT\- B . A is derivable. A pseudo-context T is well-formed if pseudo-terms A and B exist 
such that r h A : B is derivable; a context is a well-formed pseudo-context. The set of variables declared 
in pseudo-context T is called the domain ofT, dom(r). For x G dom(r), let type r (x) denote the 'type' 
assigned to x in T: ifx : A G T, then type r (x) = A. The expression type(r) denotes the set of such 'types' 
occurring in T. The set of well-typed terms ofX( r y,g/,&) is denoted by Term(?i(y ,g/ ,&)). 

We adopt the usual notions of bound and free variable, a-conversion (=), substitution (B[x := N], 
used in the rule (app)), /3-reduction (— >p) and j3 -equality (=p, used in the rule (conv)) on pseudo-terms. 

The following are well-known properties of PTSs. The relation F C A denotes inclusion between T 
and A regarded as sets of variable assignments. The third, Permutation, is a corollary of Strengthening. 

Proposition 3. 

Thinning IfT\- M :A and A^Tis well-formed, then A h M : A. 

Strengthening IfF,x : B,Ah M:Aandx<£ FV(type(A), M,A), then T, A h M : A. 

Permutation IfT,x : B,y : C, A h M : A and x (£ FV(C), then T,y : C,x : B, A h M : A. 

In proving the equivalence between a PTS and its Too-variant, we need to merge two contexts to 
create a new one. Therefore we introduce the following: 
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Definition 4. Let T and A be two pseudo-contexts. We say T and A are compatible, notation T \ \ | A, ;/ 

\/x G dom(r) ndom(A)(type r (x) = type A (jc)). 

The merge of T and A, notation T k A, is the pseudo-context T, (A \ T). This is T followed by the 
declarations x.B€L Afar which x ^ dom(r). 

Note the strong requirement in T 1 1 1 A that the types of x in T and A should be a-equal, and not just 
j3 -convertible. 

Lemma 5.IfT and A are contexts and T\\ \ A, then T ix A is well-formed. 

Proof. We write x\ : B\, . . . ,x n : B n for A. T x A is the pseudo-context T, (A \ T). As T is well-formed, 
we only have to consider the part A \ T = x^ : B ;i , . . . ,Xi n : But x\ :B\,...,Xi l -\ : -B^-i C r, so by 
Thinning Y h B (1 : 5 for some sort 5, so r,x (1 : Bi l is well-formed. 

The same reasoning applies to x,- 2 : S, 2 , . . . ,x, n : , so we conclude that T, (A \ T) is well-formed. □ 

In our system Too, the free variables will have special names, as they are labelled by their types. Of 
course, consistently renaming the free variables in a judgment r h M : A does not change its meaning, as 
the free variables in M and A are actually bound in T. For clarity, we introduce this notion explicitly. 

Definition 6. The judgment Th M : A is a-equivalent to V h M 1 : A' in case one can be obtained from 
the other by renaming bound variables, where we consider the free variables in M and A to be bound by 
their declaration in T. 

3 Pure Type Systems in the Too style 

We now make the set of variables 'f explicit. We have two kinds: variables x with a dot on top, intended 
to be bound by X and IT binders; and variables x A tagged with a pseudo-term A, intended to be bound in 
the context. This means we take Y in Definition [T] as follows, where X supplies the names of variables: 

V ::= St\X sr 
X ::= x I y \ z \ ■ ■ ■ \ xq \ x\ \ X2 \ ■ ■ ■ 

Clearly the rules for !7 and "f are mutually recursive. 

Note that although the variables are intended to be used in a certain way (made precise in Defini- 
tion [10] below), in the PTSs as defined above both kinds of y can be used for all purposes. In particular 
x can be put in a context, x A can be bound, and the label A of x A need not correspond to the type of x A . 

Note also that the definition of substitution, and hence the relation of /3 -equality, is agnostic about 
the structure of the annotations of the variables. (The definition of = J g in Section[2]takes "V just as a set). 
This means that although 

(XA : *.A)B* =p B* 

we have that 

X {XA:*.A)B* 

We will now define the rules of T^. To do this we first have to introduce the notion of hereditarily 
free variables of the types of the free variables in a term. We first motivate the need for this notion. 

In a PTS, the context takes care that one can only abstract over a variable if nothing else depends on 
that variable. In the rules, this is formalised by requiring that the x : A abstracted over in the II or X rule 
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must be the last declaration in the context. This ensures that x does not occur free in any of the other 
types in the context. In r„ we do not have contexts, so we have to replace this by another side condition 
on the rules. We would like to have a n rule as follows: 

A : s\ B:s 2 



Uy:A.B[x A :=y] : s 3 

but that is wrong, because then we would be able to form (in PTS terminology) the IT-type 

... hA : * A : *,P : A— >*,Q : TbcA.P x— >*,a : A,h : Pah Qah : * 
A : *,P : A^*, Q : Ylx:A.Px^*,h -.PahYly: A.Qyh : * 

But this cannot be correct, because h is not of type Py in the conclusion. In r^, this derivation would 
read (adding some brackets for readability): 

A* : * Q n^*.(p^**H* a A* . ^ 

ny:A\Q m - A *-( pAt ^**)^*yh pA ^* aA * : * 

which would be derivable according to the Il-rule above, but clearly undesirable. 

Definition 7. Given M G 2T, we define the hereditarily free variables in M, denoted hfv(M), as follows: 

hfv(j) =hfv(jc) = 

hfv^) = {^}Uhfv(A) 

hfv(FA^) = hfv(F)Uhfv(AO 

hfv(Ai:A.A^) =hfv(m:A.A^) = hfv(A) Uhfv(A^) 

So, where =p basically ignores the structure of the type labels of free variables, we now take them 
seriously, collecting the variables (hereditarily) free in the type labels as well. 

We put as a side condition in the Il-rule that x A should not occur free in any of the types of the free 
variables in B, and similarly for the A rule. We give an explicit definition of this notion. 

Definition 8. Given M G S7,we define the hereditarily free variables of the types of the free variables in 
M, denoted hfvT(M), as follows: 

hfvT(s) = hfvT(x) = 

hfvT(x A ) = hfv(A) 
hfvT(FA0 = hfvT(F)UhfvT(AO 
hfvT (Ax:AJV) = hfvT(Hx:A.A^) = hfvT(A) U hfvT(A^) 

So, for example hfvT(/i />A ^*° A ) = {P A ^* ,a A ,A*}. An easy corollary of the definition is that 

hfvT(M) C hfv(M) 

We now give the derivation rules of the system. 

Definition 9. The derivation rules ofT^ are given by the inference rules in Figure^ The IT and X rules 
have the side condition that x should not be captured in B or M when doing the substitution. That is, x 
should not be bound by a binder under which y 4 occurs. 

The side condition on the IT and A rules is no restriction as you can go to an a-equivalent version of 
the term afterwards. 
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(sort) 
(var) 

(n) 

(app) 
(conv) 



si : s 2 

A:s 
~? :A 

A:s\ B : s 2 
YLx:A.B\/ :=x] : s 3 

M : B Ux:A.B\/ :=x]:s 



if (si,s 2 ) G 



AxA.M[/ := x] : rix:A.B[/ := i 
M : ITx:A.fi N:A 



if (ji,j 2 ,J3) Shandy 4 ^ hfvT(B) 
ify 4 ^hfvT(M)UhfvT(S) 



MN:B[x:=N] 
M : A B:^ 



A =/3 B 



Figure 2: Typing rules for Too 

4 The correspondence theorems 

We now prove that a PTS and its r^- variant correspond to each other. For a r„ judgment M : A we 
generate a context T such that r h M : A is PTS-derivable. Conversely, if T h M : A is PTS-derivable, we 
always have an a-equivalent judgment (see Definition [6]) PhM': A' such that M 1 : A' is derivable in T^. 
This specific form PhM': A' we call a ?y/?e annotated judgment. 

Definition 10. A type annotated context in a PTS is a context of the form 

x B 1 1 :B h ...,x B n ":B n 

where we moreover assume that all bound variables in the Bi are of the form x. 
Definition 11. A type annotated judgment in a PTS is one of the form 



x \ ■ Bl j • • • i^n " D n 

Br B 
where: x^ : B\ , . . . ,x"" : B n is a type annotated context; all free variables in M and A are of the form x- '; 

and all bound variables are of the form x. 

We now first show the easy direction of our correspondence result: 
Lemma 12. Every judgment T\- M : A in a PTS is a-equivalent to a type annotated judgment PhM': A'. 

Proof. From left to right we rename the variables in the context (and of course also in M and A) to the 
'standard' names 

v 8 ! ■ R r 5 " • 7? 

(Here x,- is not a meta-variable for a variable name, but really the explicit variable name "x," in 56 '.) We 
also a-rename any bound variable of the form x A to a fresh variable of the form x. □ 



x®" :B„\-M:A 
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As an example, consider the PTS judgment 

A : *, a : A h (Ax:A.x)a : A 

This does not fit the variable names from our Y, so this does not conform to the definitions in this paper. 
Instead using our variables it should be something like: 

A:*,a*:A\- {Xx B :A.x B )a* :A 

This of course is not a type annotated judgment, the annotations make no sense at all, but still this is a 
perfectly fine PTS judgment as defined in Definition [T] 

Now according to the theorem this is a-equivalent to a judgment that is type annotated. And it is, for 
example it is a-equivalent to 

h (Xx:x\.x)x 2 l 

Or, if one uses more readable names, to 

A* : *,a A * :A* h {XxA*.x)a A * 

The other part of the easy direction of our correspondence is that a type annotated PTS judgment 
essentially is the same as a Too judgment: 

Theorem 13. If the type annotated PTS judgment T h M : A is derivable, then M : A is a derivable 
judgment of the corresponding Too type theory. 

Proof. By induction on the size of the derivation of Y h M : A. We do a case split on the last rule used in 
the derivation: 

(sort) Immediate. 

(weak) Trivial, because if T,x.A is a type annotated context, then certainly T is a type annotated context. 

(var) By induction we have A : s derivable in Too, and because the context is type annotated the variable 
name must be of the form x A . Hence x A : A in Too. 

(conv) We know that r h M : A and r h B : s. Now A need not have bound variables of the form x, but 
one can rename them to obtain A' = A such that r h M : A' will be type annotated. Then M : A' and 
B : 5 in Too by induction and therefore also M : B in Too. 

(app) Again, we might need to change the bound variable in T h M : TlxA.B to the dotted kind, to get a 
type annotated judgment. Apart from that this case is trivial, just like the previous one. 

(IT) The conclusion will be of the form T h Tlx A.B : S3. Now if we take y 4 a completely fresh variable, 
then r, y 4 : A h B[x := y 4 ] : S2 will be a type annotated judgment, as well as being a-equivalent to 
T,x:AhB: s 2 . Accordingly, let B' := B[x := y 4 ], so that B = B'^ := x]. 

Clearly now y 4 hfvT(B') because r.y 4 : A h B' : S2, so all type annotations will be typable in T, 
which does not contain y 4 . 

By induction A : *i and B' : s 2 in Too and because y 4 ^ hfvT(B') we get that Tlx'A.B , \y A := i] : s^ 
in Tco. But this is precisely TlxA.B : S3. 

(A) This case essentially follows that of the previous one. 

□ 
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The other direction of our correspondence — from to traditional PTS style — is a bit more involved. 
We need to synthesise an appropriate context, and because we build this context by recursion over the 
type derivation, we need to merge these synthesised contexts using >< . For this we need a number of 
lemmas involving type annotated contexts in PTSs. 

Lemma 14. IfT and A are type annotated contexts, then T \ \ | A. 

Proof. If xr G dom(r) n dom(A), then xr : A G T and x A : A G A, of course, A = A and this is what we 
need to prove according to Definition [4] □ 

Lemma 15. IfT is a type annotated context with xr G dom(r), r h M : B andx A ^ hfvT(M,B), then 

3A C r(A,x A :A\-M:B) 

Proof. Write T = T u x A : A,T 2 , and suppose y c G dom(r 2 ) with r 4 G FV(C). If y c G FV(M,B), then 
jc 4 G hfvT(M,B), contradiction. Soy c FV(M,B). This means that all declarations y c : CinL^ for which 
x A G FV(C) can be removed by Strengthening (Proposition [3]), starting from the rightmost declaration in 
T 2 . We end up with a judgment 

T h xr :A,r' 2 hM:fi 

with T' 2 C r 2 and ^ type(r' 2 ). Using Permutation (Proposition we conclude that Ti^r^x^ : A h 
M : B and we take H , V 2 for A. □ 

Corollary 16. If T \- M : B is a type annotated judgment, there is a A C T such that A h M : A a«<i 
dom(A) Chfv(M,B). 



Proo/ Let x A G dom(r) and ^ hfv(M,B). Then ^ hfvT(M,B), so (according to Lemma 15 ), there 
is a A C T such that A,^ :AhM:S. But also x A £ FV(M,B), so by Strengthening (Proposition [3]), 
ArM.B. □ 

So, in r h M : B, we can always make the context T so small that its domain is within the set of 
hereditarily free variables of M,B. The other way around, the hereditarily free variables of M,B should 
be in dom(r): 

Lemma 17. IfT h M : B is type annotated, then hfv(M,B) C dom(r). 

Proof. We prove rhM:8=> hfv(M) C dom(r), by induction on the derivation and then we are done, 
because if T h M : B, then r h B : s for some sort 5, or B is a sort. 

(sort) Immediate. 

(var) By induction, hfv(A) C dom(r), so hfv(x A ) C dom(r,x A : A). 

(conv) By induction, hfv(M) C dom(r), so we are done. 

(app) By induction, hfv(F) C dom(r) and hfv(M) C dom(r), so hfv(FM) C dom(r). 

(n) By induction, hfv(A) C dom(r) and hfv(B) C dom(r,/ : A), so hfv(rii:A.B[y A :=x\) C dom(r). 

(A) By induction, hfv(M) C dom(r,y 4 : A) and hfv(rii:A.B[y 4 :=x]) C dom(r), so hfv(A) c dom(r), 
and hence Mv(Xx:A.M\y A := x}) C dom(r). 

□ 

The more difficult direction of our correspondence now follows: 
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Lemma 18. Let M : Abe a derivable Too judgment. Then all free variables in M and A have the form x A 
and all bound variables have the form x. 

Proof. By induction on the derivation of M : A. □ 

Theorem 19. Let M :A be a derivable Too judgment. Then there is a type annotated judgment T h M : A 
derivable in the associated PTS, such that T contains exactly the variables in hfv(M) Uhfv(A). 

Proof. By induction on the derivation of M : A we show there exists a type annotated context T(M,A) 
such that T(M,A) h M : A. Note that T(M,A) depends on the derivation of the judgment M : A, not just 
on the terms M and A. 

(sort) Immediate. 

(var) By induction, T(A,s) \- A:s. So T(A,5),r 4 :A ] rx A :A. 

(conv) By induction, T(M,A) h M :A and s)h B : s, and we also know that A =a B. 
So r(M,A) x r(B, j) h M : fi by Thinning and the (conv) rule. 

(app) By induction, T(F,Ux:A.B) h F : ITiA.fi and T(M,A) h M : A. 

So r( J F,ni:A.B) X T(M,A) h FM : B[i := M] by Thinning and the (app) rule. 

(IT) By induction, r(A, s\) h A: si and T(B,S2) \~ B : s 2 . 

If y 1 ^ T(B,s 2 ), then T(A,si) x I^fi,^),/ 1 : A,- h B : s 2 , so by Thinning and the (IT) rule we have 
T(A,si)tKT(B,s 2 )\-Ux:A.B\y A :=x] :s 3 . 

If y 4 G dom(r(B, j 2 )), then A,/ : A h B : s 2 for some A C r(B,s 2 ). So by Thinning and the (II) 
rule we have r(A, si) x A h ITxiA.B^ := x] : s^. 

(A) By induction, we obtain T(M,B) \- M : B and r(Tlx:A.B{y A :=x],j) h ITi:A.B[/ :=i] : 5. 

If y 4 i dom(r(M 5 B)), then T(A,s{) x T(M,B),y 4 :A,hM:fi. So by Thinning and the (A) rule, 
we have T{Y\x:A.B\y A := jc],j) X T(M,B) h Ai:A.M[/ :=i] : ITrA.fify 4 :=i] 



If / G dom(r(M,B)), then A,/ : A h M : B for some A C T(M,B), by Lemma[l5| So by Thinning 
and the (A) rule, we have r(ni:A.B[/ := x],s) x A h Ai:A.M[/ := x] : Ili^.B^ := x]. 



By Lemma [17| dom(r(M,A)) D hfv(M,A). Corollary [16| lets us strengthen the context to A C T(M,A) 
such that A h M : A and dom(A) = hfv(M,A). □ 

Corollary 20. Let 

M : A 

Z?e a derivable Too judgment. Take all variables of the form xf occurring in hfv(M) Uhfv(A), and put 
them in any order xf t [ , . . . ,xf^' such that ifx^ occurs in Ai then k < I. Then the following judgment is 
derivable in the PTS: 

xj 1 :A 1 ,...,xf; :A n \-M:A 
Proof. From the previous Theorem using Permutation. □ 
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5 Conclusion and Further work 

There are three obvious continuations of this work: 

1. The first is to investigate to what extent other type theories than the PTSs admit a Too presentation. 

2. The second is to see how well the approach presented here can be used as a basis of an LCF-style 
kernel for type theory. 

3. The third is to formally develop the theory presented in this paper in a proof assistant. 

With respect to the first point: we expect most type theories to have a r^-variant, although the obser- 
vations about universe inconsistency [19] arising from merging contexts may complicate the picture for 
applied type systems such as that of Coq. More important is to investigate how our approach needs to be 
adapted to support type theories with definitions. As previously noted, in any real implementation, the 
definitions for defined constants form a more significant part of the contexts T we are eliminating than 
the free variables. 

We are currently investigating the second point, developing an OCaml implementation for the PTS 
XP extended with definitions (a system corresponding to the logical framework LF) along the lines of 
this paper. The main question is how expensive, computationally, the two following operations are: 

• The substitutions [y 4 := i] that occur in the X and IT rules. 

• The check of the side-condition y 4 ^ hfvT(M,B) in the X and IT rules. 

The first is in some sense 'local', because it does not look inside constant definitions in the environment. 
To make the second reasonably efficient will be harder. It is possible we need to consider three kinds 
of variables, distinguishing (1) i bound variables, (2) y 4 variables to be substituted with bound variables 
later (essentially, the eigenvariables of the (IT) and (A) rules), and (3) ^ variables that do remain free, 
so they may be considered as 'axiomatic constants' of the system. 

In such a system, it is essential that the implementation language can use pointer equality to effi- 
ciently determine equality of terms (and in particular, equality of annotations in variable occurrences). 
This motivated our choice of OCaml, which is such a language. Although in OCaml the comparison 
function "=" does not have this feature (because floating points NaNs are not taken to be equal to them- 
selves, the system never looks at pointer equality when evaluating "x = y"), the comparison function 
"fun x y -> Pervasives . compare x y = 0" does. 

We are currently working on the third point as well. In ongoing work [10] the second author has 
formalised a large part of the theory presented in this paper up to the first direction of the correspondence 
theorem in the proof assistant Coq. However, the presentation in this formal development is slightly 
different from that of this paper. 

• Firstly, we distinguish bound from free variables at the level of PTS pseudo-terms, following 
existing practice, established since the third author's work with Pollack in the 1990s |[T4l [TBI . 
Our informal presentation above uses named variables in each case, the so-called locally named 
approach, whereas the formalisation uses the locally nameless representation: de Bruijn indices 
for bound variables and names for free variables. Because we make this difference at the level of 
PTS pseudo-terms already, we have a canonical representative for each term and therefore need 
not worry about a-equivalence. For further details of both approaches, see |[T5l [Tl for example]. 

• Secondly, variable binding in the (IT) and (A) rules of Too is handled by substituting free variables 
for bound variables, rather than bound for free as in Section[3j This choice has been used success- 
fully in other formalisations {ibid.), and emphasises the conceptual priority of free variables over 
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bound. Recent work by Pollack and Sato compares the two approaches, in a detailed account of 
canonical representation for languages with binding [20]. 

Based on the methodology described in HI, we have combined the locally nameless presentation with 
co-finite quantification to obtain strong induction principles. To be sure that our co-finite presentation is 
adequate we have proved it to be equivalent to an exists-fresh presentation. 

For example the (II) rule, in respectively exists-fresh and co-finite presentation, is as follows. 

A : s\ B\0 := x A ] : s 2 
(11 exists-fresh) - - if {s u s 2 ,s 3 ) e&andx*^ hfv(fi) 

riA.fi : s 3 

A:si \/x<£L .B[0:=x A ]:s 2 , x „ 

(n co-finite) i if (si, s 2 , s 3 ) e ffl and L C finite f 

ITA.fi : St, 

Observe that we require x 71 ^ hfv(fi) instead of X 4 ^ hfvT(fi) (this was also observed by John Boyland). 
The reason for this is that we have to bind every occurrence of the free variable x A in fi[0 := xr], hence x A 
should not be in FV(fi) either. While the condition is (potentially) more expensive to check, it removes 
the need for Definition [8j in favour of the (conceptually) simpler Definition [7] 

The other direction of the correspondence theorem uses the second property, Strengthening, of Propo- 
sition [3] in an essential way. This presents two difficulties: a practical one, since the existing formalisa- 
tions of this lemma are highly non-trivial [15]; and a theoretical one, namely that we may not be able to 
establish a correspondence between traditional and presentations of a given type theory without first 
establishing strengthening. 

More interestingly, from the point of view of the pragmatics of formalisation, for this direction it is 
essential to abstract over the kinds of free variables used in the definition of PTS judgments. At first 
this does not seem troublesome, however, many definitions and theorems do not just depend on the kind 
of free variables but also on finite sets of free variables. Hence we are also required to abstract over 
various operations on such finite sets. The recently developed finite set library fT3l . based on the new 
type classes feature in Coq, might be very useful in implementing this abstraction. 
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